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ABSTRACT 

We present several different statistical methods to determine the transverse velocity vector of M31. 
The underlying assumptions are that the M31 satellites on average follow the motion of M31 through 
space, and that the galaxies in the outer parts of the Local Group on average follow the motion of the 
Local Group barycenter through space. We apply the methods to the line-of-sight velocities of 17 M31 
satellites, to the proper motions of the 2 satellites M33 and IC 10, and to the line-of-sight velocities 
of 5 galaxies near the Local Group turn-around radius, respectively. This yields 4 independent but 
mutually consistent determinations of the heliocentric M31 transverse velocities in the West and 
North directions, with weighted averages (vw) = — 78 ± 41 kms" 1 and {vn} = —38 ± 34 kms -1 . The 
uncertainties correspond to proper motions of ~ 10 //asyr -1 , which is unlikely to be within reach of 
direct observational verification within the next decade. The Galactocentric tangential velocity of M31 
is 42kms~ 1 , with la confidence interval Vtan < 56kms _1 . The implied M31-Milky Way orbit is bound 
if the total Local Group mass M exceeds 1.72±g;|| x 10 12 M Q . If the orbit is indeed bound, then the 
timing argument combined with the known age of the Universe implies that M — 5.58^q'72 x 10 12 M q . 
This is on the high end of the allowed mass range suggested by cosmologically motivated models for 
the individual structure and dynamics of M31 and the Milky Way, respectively. It is therefore possible 
that the timing mass is an overestimate of the true mass, especially if one takes into account recent 
results from the Millennium Simulation that show that there is also a theoretical uncertainty of 41% 
(Gaussian dispersion) in timing mass estimates. The M31 transverse velocity implies that M33 is in 
a tightly bound orbit around M31. This may have led to some tidal deformation of M33. It will be 
worthwhile to search for observational evidence of this. 

Subject headings: galaxies: kinematics and dynamics — Local Group — M31. 



1. INTRODUCTION 

The Local Group is dominated by two spiral galax- 
ies, M31 and the Milky Way. These galaxies have com- 
parable properties, with M31 generally believed to be 
slightly more massive (e.g., Klypin, Zhao & Somerville 
2002). The next most luminous galaxies, M33 and the 
Large Magellanic Cloud, are some 10 times fainter (e.g., 
van den Bergh 2000). The dynamics and future of the 
Local Group are therefore determined primarily by the 
relative velocity of M31 with respect to the Milky Way. 
Unfortunately, this velocity is poorly known. The line- 
of-sight velocity of M31 can be measured extremely accu- 
rately using the Doppler shift of a large variety of trac- 
ers. However, even after a century of careful attempts 
(starting with, e.g., Barnard 1917) still no useful proper 
motion measurement exists to constrain the transverse 
velocity. This limits our ability to answer several funda- 
mental questions. For example, do M31 and the Milky 
Way indeed form a bound system, as is usually assumed 
(e.g., van den Bergh 1971)? What is the exact mass of 
the Local Group implied by the so-called timing argu- 
ment (e.g., Kahn & Woltjer 1959; Kroeker & Carlberg 
1991; Lynden-Bell 1999)? What is the expected future 
evolution of the M31-Milky Way system (e.g., Cox & 
Loeb 2007)? And how has the structure of M33 been 
influenced by possible interaction with M31 (e.g., Loeb 



et al. 2005; hereafter L05)? 

In the present paper we show that it is possible to ob- 
tain a statistical determination of the transverse velocity 
of M31 using the observed velocities of its satellites. The 
analysis assumes that on average the satellites follow the 
motion of M31 through space, with some velocity dis- 
persion. The ensemble of line-of-sight velocities, and the 
individual proper motions available for selected galaxies, 
then yield independent estimates of the M31 velocity. We 
also revisit the method previously explored by Einasto 
& Lynden-Bell (1982), which is based on the assumption 
that the galaxies in the outer parts of the Local Group 
follow the motion of the Local Group barycenter through 
space. We apply the different methods to the currently 
available data for the relevant Local Group galaxies, and 
we combine the results to obtain an accurate determi- 
nation of the M31 transverse velocity. We then use this 
determination to address the aforementioned questions. 

The structure of the paper is as follows. Section [2] 
discusses the constraints on the M31 transverse velocity, 
based on: the line-of-sight velocities of an ensemble of 
17 M31 satellites fSection l2.1|) : the recent high accuracy 
proper motion determinations of the M31 satellites M33 
and IC 10 by Brunthaler et al. (2005, 2007) from VLBI 
observations of water masers (Section 12 . 2[) ; and the line- 
of-sight velocities of an ensemble of galaxies in the outer 
parts of the Local group (Section 12. 3p . The different 
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constraints are compared and combined in Section 12.41 
Section [3] discusses the implications of the inferred M31 
velocity for the relative orbit of M31 and the Milky Way 
(Section 13. ip and for the relative orbit of M31 and M33 
(Section 13. 2p . Section |4] discusses how the results for the 
M31 transverse velocity (Section 14. ip . the total mass of 
the Local group as implied by the timing argument (Sec- 
tion [472]) , and the Local Group turn-around radius (Sec- 
tion 321) compare with theoretical predictions and other 
observational studies. Section [5] presents the conclusions 
of our work. 

In the analysis below we use several coordinate sys- 
tems. Observational systems have their three principal 
axes aligned with the line-of-sight, West, and North di- 
rections, respectively, for a given position on the sky. 
We also use a Galactocentric coordinate system centered 
on the Milky Way and a barycentric coordinate system 
centered on the Local Group barycenter. The analysis 
requires transformations between the positions and ve- 
locities in these systems. Many of the necessary nota- 
tions and derivations can be found in van der Marel et 
al. (2002; which presents a study of the kinematics of 
the Large Magellanic Cloud), and are given here with- 
out further reference. Heliocentric velocities are gener- 
ally denoted with a vector v, Galactocentric velocities 
with a vector V, and Local-Group barycentric velocities 
with a vector U. Velocities and proper motions in the 
observational systems are generally heliocentric (i.e., not 
corrected for the reflex motion of the Sun), unless stated 
otherwise. 

2. THE M31 TRANSVERSE VELOCITY 

2.1. Constraints from Line- of- Sight Velocities of M31 

Satellites 

The velocity vector of an M31 satellite galaxy can be 
written as the sum of the M31 velocity vector, and a 
peculiar velocity 



TABLE 1 
M31 Satellite Galaxy Sample 



«sat = "M31 + V\ 



pec- 



(1) 

We assume that any one-dimensional component of v pcc 
is a random Gaussian deviate with dispersion a. More 
formally, this is true if the velocity distribution of the 
satellites is both isotropic and isothermal. The first 
assumption finds some support from studies of galaxy 
satellite systems (e.g., Kochanek 1996) and clusters of 
galaxies (e.g., van der Marel et al. 2000). The second 
assumption is reasonable in view of the fact that the 
gravitational potentials of dark halos are approximately 
logarithmic. Either way, in the present analysis there is 
only a very weak dependence on the accuracy of these 
assumptions (by contrast to studies of the mass distribu- 
tion of M31; e.g., Evans et al. 2000). 

The velocity vector of M31 can be described by quanti- 
ties v sys , v t and ©t, where v sys is the line-of-sight velocity, 
v t is the transverse velocity, and Ot is the position angle 
of the transverse motion on the sky. The velocities in the 
directions of West and North are 

v w = v t cos(e t + 90°), v N = v t sin(e t + 90°). (2) 

The position of a satellite on the sky can be described by 
the angles (p, $), where p is the angular distance from 
M31 and <& is the position angle with respect to M31 
measured from North over East (as defined in van der 
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deg 


deg 


kms - 1 


(1) 


(2) 


(3) 


(4) 


(5) 


M31 


Sb I-II 


0.00 




-301 ± 1 


M32 


E2 


0.40 


-179.10 


-205 ± 3 




dSph 


0.60 


-46.61 


-244 ± 3 


And IX 


dSph 


2.69 


43.32 


-211 ± 3 


And I 


dSph 


3.27 


169.84 


-380 ± 2 


And III 


dSph 


4.97 


-163.13 


-355 ± 10 


And X 


dSph 


5.62 


48.96 


-164 ± 3 


NGC 185 


dSph/dE3 


7.09 


-5.07 


-202 ± 7 


NGC 147 


dSph/dE5 


7.43 


-12.29 


-193 ± 3 


And V 


dSph 


8.03 


35.31 


-403 ± 4 


And II 


dSph 


10.31 


136.82 


-188 ± 3 


M33 


Sc II-III 


14.78 


131.78 


-180 ± 1 


And VH/Cas 


dSph 


16.17 


-47.93 


-307 ± 2 


IC 10 


dlrr 


18.37 


-9.11 


-344 ± 5 


And Vl/Peg 


dSph 


19.76 


-143.64 


-354 ± 3 


LGS 3/Pisces 


dlrr/dSph 


19.89 


165.44 


-286 ± 4 


Pegasus 


dlrr/dSph 


31.01 


-143.37 


-182 ± 2 


IC 1613 


dlrr V 


39.44 


171.30 


-232 ± 5 



Note. — The sample of M31 satellites used in the model- 
ing of Section r2.ll M31 itself is listed on the first line for com- 
parison. Column (1) lists the galaxy name and column (2) its 
type. Columns (3) and (4) define the position on the sky: p 
is the angular distance from M31 and $ is the position angle 
with respect to M31 measured from North over East. These 
angles were calculated from the sky positions (RA,DEC) as 
in van der Marel et al. (2002). The satellites in the table are 
sorted by their value of p. Column (5) lists the heliocentric 
line-of-sight velocity and its error. Velocities and sky posi- 
tions (RA,DEC) for most satellites were obtained from the 
compilation of Evans et al. (2000), except for the satellites 
And IX and X which had not yet been discovered in 2000. 
For those we used the velocity measurements and (RA,DEC) 
given by Chapman et al. (2005) and Kalirai et al. (2007), 
respectively. 

Marel & Cioni 2001; van der Marel et al. 2002). Because 
satellites are not located on the same position on the sky 
as M31, the vector VM31 has a different decomposition 
in line-of-sight and transverse components than it does 
for M31. More specifically, the line-of-sight velocity of a 
satellite is 



^sat.los 



; cos p + v t sin p cos(<l> — O t 



(3) 



The factor cos p in the first term indicates that only a 
fraction of v sys is seen along the line-of-sight. The second 
term is an apparent solid body rotation component in the 
^sat.ios velocity field on the sky, with amplitude Vt sin p 
and kinematic major axis along position angle O t . The 
last term merely adds a scatter a on top of the velocity 
field defined by the first two terms. 

It follows from equation ([3]) that the transverse velocity 
of M31 affects the line-of-sight velocities of its satellites. 
Therefore, a study of the line-of-sight velocities of the 
satellites can constrain the M31 transverse velocity. To 
this end we compiled the sample in Table [TJ which con- 
sists of M31 satellites with known velocities. The sample 
is based on that used by Evans et al. (2000), but with ad- 
dition of the more recently discovered Andromeda dwarf 
satellites And IX and X. The satellites And XI and XIII 
(Martin et al. 2006, who also reports the finding of an 
unusually distant globular cluster) and And XV and XVI 
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(Ibata et al. 2007) are not included because no line-of- 
sight velocity measurements have yet been reported for 
them. We fitted equation §5§ to these data by determin- 
ing the values of (v sys , vyy, ujv) that minimize the scat- 
ter in tJpec,k>s' The resulting scatter is the dispersion a 
of the satellite population. After the best-fitting model 
was identified, we calculated error bars on the fitted pa- 
rameters using Monte-Carlo simulations. Many different 
pseudo data sets were created with the same satellites at 
the same positions, but with velocities drawn from the 
best-fit model, with i> p0 c.ios drawn as a random Gaussian 
deviate with dispersion a. The pseudo data sets were 
then analyzed similarly as the real data set. The disper- 
sions in the inferred model parameters are a measure of 
their formal 1-a error bars. 

The modeling procedure yields v sys = — 270±19kms~ 1 , 
Vw = —136 ± 148 kms -1 , — — 5 ± 75 kms -1 , and 
a = 76 ± 13 kms -1 (see also Table [3] below) . The data 
and representative predictions of the best-fitting model 
are shown in Figure [TJ Color-coding indicates the dis- 
tance from M31. The inferred transverse velocity cor- 
responds to sinusoidal variations that are less than the 
observed scatter in the data. No sinusoidal variation with 
angle is discernible in the data by eye; indeed, the case 
of zero transverse velocity (amplitude zero for the sinu- 
soidal variations) is also statistically consistent with the 
data. However, this is not a null result. Large transverse 
velocities of hundreds of km s -1 would have induced large 
sinusoidal variations that are not seen in the data. Such 
transverse velocities are therefore ruled out. We note 
that we could also have kept v sys fixed in the fit at the 
observed value of v sys — —301 ± 1 kms -1 (Courteau & 
van den Bergh 1999). We verified that such a fit yields 
verify similar results, namely v\y = —123 ± 159 kms -1 
and djv = —33 ± 79 kms -1 , which is well within the un- 
certainties quoted above. 

Our analysis does not assume that the satellites are 
necessarily bound to M31. For M33, orbit calculations 
do suggest that it probably is bound (see figure 3 of Loeb 
et al. 2005), but this has not been established with confi- 
dence for most of the other satellites. We decided not to 
include the recently discovered dSph galaxies And XII 
and And XIV in our sample. They lie at similar posi- 
tion angles, $ = 171.94° and 170.49°, respectively, and 
at distances p — 6.96° and 11.71°, that are not atypi- 
cal for the rest of our sample. However, their velocities 
of —556 ± 5 and —478 ± 5 kms -1 , respectively, are 3.3<r 
and 2.3a away from the observed velocity of M31. It has 
been suggested that they form a system that is falling 
into M31 for the first time (Chapman et al. 2007; Ma- 
jewski et al. 2007), which would not make them useful 
additions to our analysis. As a test we did repeat our 
analysis with these galaxies included in the sample. This 
yielded u sys = -296±27kms -1 , v w = -85±200kms -1 , 
v N = 41 ± 108 kms' 1 , and a = 106 ± 18 kms -1 . This is 
consistent with the result for our main sample to within 
the uncertainties. This illustrates that the results are 
fairly robust against the inclusion or removal of individ- 
ual galaxies. By contrast, modeling of the parent galaxy 
mass based on satellites can be quite sensitive to assump- 
tions about the bound state of individual galaxies (e.g., 
Kochanek 1996). 
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Fig. 1. — Comparison of heliocentric line-of-sight velocity data 
of M31 satellites and the predictions of equation for the best- 
fit values of the heliocentric M31 transverse velocity (vyy,vpf) = 
(—136, -5)kms _1 . The curves show the predictions as function of 
position angle <t>, for angular distances from M31 of p = 3° (red), 
p = 9° (cyan), p = 15° (green), p = 29° (black). Data points 
are from Table [T] They are color coded based on bins in angular 
distance p, from 0-6° (red), 6-12° (cyan), 12—18° (green), and 18- 
40° (black). Predictions with zero transverse velocity (amplitude 
zero for the sinusoidal variations) are also statistically consistent 
with the data. However, a large transverse velocity for M31 of 
hundreds of km s — 1 (corresponding to sinusoidal variations of much 
larger amplitude than show here) are ruled out. 

2.2. Constraints from Proper Motions of MSI 
Satellites 

Water masers can be observed at high spatial reso- 
lution with VLBI techniques. This makes them a valu- 
able tool for proper motion studies of Local Group galax- 
ies. Brunthaler et al. (2005, 2007) recently determined 
the proper motions for two galaxies in the M31 group, 
namely M33 and IC 10. Unfortunately, no water masers 
have yet been found in M31 and, at present detection 
limits, may be none should be expected (Brunthaler et 
al. 2006). So the direct application of this technique to 
M31 may not be possible in the foreseeable future. How- 
ever, the measurements for M33 and IC 10 can be used 
to constrain the transverse velocity of M31 indirectly. 
Equation {1} implies that the unknown velocity vector 
of M31 can be estimated from the known velocity vector 
of a satellite as VM31 — ^sat — v pcc . Since the peculiar 
velocity is unknown, it acts as a Gaussian uncertainty of 
size a in each velocity component. 

In analogy with equation ([3]), one can write for the 
transverse velocity components of the satellite 



^sat,2 : 
^sat,3 : 



y sys 



smp 



-Vt sin($ 



h Vt cospcos($ 
S t ) + v poc . 3 . 



e t ) 



y pcc,2, 



(4) 



Here the unit vectors 2 and 3 on the plane of the sky are 
related to the directions of West and North, all at the 
position of the satellite, according to a rotation 



V S nt,W 
V S at,N 



-sinT 
cosT 



cosT 
sinT 



^sat,2 
^sat,3 



(5) 



where the rotation angle T is determined by 

cos r = [sin S cos Sq cos(a — «o) — cos 5 sin So] / sin p, 
sinr= [cos<5osin(a — chq)] /sin p. (6) 
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Here (a, 5) are the RA and DEC of the satellite, and 
(ao,6o) are the RA and DEC of M31 (i.e., the position 
with p = 0). 

Given values of (u sat ,los, Usat.Wj «sat,iv), the equa- 
tions ([2j) — ([6]) uniquely constrain the three unknown com- 
ponents (w S ys, %, wjv) of the M31 velocity vector. We 
solve these equations for each of the two satellites M33 
and IC 10. We take v sa t,ios from Table Q] To obtain 
w sat,w and w sat.N we write each velocity component as 
v = (0.0047404Z?^ o bs) — Sv mt , where /z b s is the ob- 
served proper motion in /zasyr , D is the satellite 
distance in kpc, and Sv mt is a correction for the in- 
ternal rotation of the galaxy under study. We take 
D = 794 ±23 kpc for M33 (McConnachie et al. 2004) and 
D = 660 ±65 kpc for IC 10 (Evans et al. 2000). The other 
quantities follow from Brunthaler et al. (2005, 2007): 
(Atobs,w,Atobs,N) = (-4.7 ± 3.2, -14.1 ± 6.4) ^asyr -1 for 
M33; (Mob s ,w,Mob s , N ) = (-6.0 ± 5.0, 23.0 ± 5.0) ^asyr" 1 
for IC10; (<5u ro t,w, <Kot,N) = (70 ± 23, -81 ± 23) kms" 1 
forM33; and (5v Tot ,w, Sv Tot ^) = (-25±19, 9±19)kms- 1 
for IC 10. We do not include the Brunthaler et al. cor- 
rections for the reflex motion of the Sun, since we deal 
with that issue separately in Section [3~T1 We add Gaus- 
sian random deviates in our calculations to reflect the 
uncertainties. We take each component of v pcc to be a 
Gaussian random deviate with dispersion a = 76kms _1 , 
as determined in Section 12.11 For each combination of 
(w S at,ios,Wsat,w,^sat,N) we solve the equations to obtain 
(v sys , vw, v n) and we repeat this in Monte-Carlo fash- 
ion. We adopt the average and dispersion of the results 
as our final estimate for the M31 velocity vector and its 
error. Using M33, we obtain the following estimates for 
M31: v sys = -183 ±76 kms" 1 , v w = -48±80kms- 1 , 
ujv = 71 ± 84 kms" 1 . Using IC 10, we obtain the fol- 
lowing estimates for M31: u sys = —346 ± 76 kms -1 , 
vw = —16 ± 80 kms -1 , = —47 ± 81 kms" 1 (see also 
Table [3] below). 

2.3. Constraints from Line-of-Sight Velocities of Outer 
Local Group Galaxies 

The Local Group contains not only the virialized sub- 
groups of galaxies surrounding the Milky Way and M31, 
but also a number of unattached galaxies that populate 
the outer regions of the Local Group. On average, these 
galaxies are expected to follow the motion of the Local 
Group barycenter through space. Their heliocentric line- 
of-sight velocity vectors, averaged in a three-dimensional 
sense, therefore statistically equal the reflex motion of the 
Sun with respect to the Local Group barycenter. Since 
the velocity of the barycenter is itself determined by the 
relative velocity of M31 with respect to the Milky Way, 
this yields a determination of the latter. Variations of 
this method have been applied in the past by, e.g., Yahil, 
Tammann, & Sandage (1977), Lynden-Bell & Lin (1977) 
and Einasto & Lynden-Bell (1982). It is now worthwhile 
to revisit this method, since available information on the 
membership and distances of Local Group galaxies has 
evolved significantly in the past decades. 

We adopt a Cartesian coordinate system (X, Y, Z) , 
with the origin at the Galactic Center, the Z-axis point- 
ing towards the Galactic North Pole, the X-axis pointing 
in the direction from the sun to the Galactic Center, and 
the Y-axis pointing in the direction of the sun's Galac- 



tic Rotation. For the set of outer Local Group galaxies 
i = 1 , . . . , N we calculate the unit vector fj in the direc- 
tion of each galaxy. If the velocity vector of the Sun with 

respect to the Local Group barycenter is Uq, then one 
has 

^( w i°s,* + [U Q ■ n])n = 0, (7) 

i 

where t>i os ,i is the heliocentric line-of-sight velocity for 
each galaxy. This can be written as a 3 x 3 matrix equa- 
tion for the components of the vector Uq. The best- 
fit values and their formal errors are easily obtained us- 
ing standard techniques (Einasto & Lynden-Bell 1982). 
Once this solution is obtained one can calculate the ve- 
locity of the Milky Way with respect to the Local Group 
Barycenter, Umw = Uq — Vq. Here Vq is the veloc- 
ity vector of the Sun in the Galactocentric rest frame. 
For the circular velocity of the Local Standard of Rest 
(LSR) we use the standard IAU value Vq = 220 kms -1 
(Kerr & Lynden-Bell 1986), to which we assign an uncer- 
tainty 10 kms -1 (none of our results depend sensitively 
on this quantity). For the residual velocity of the Sun 
with respect to the LSR we adopt the values of Dehnen 
& Binney (1998). If we assume that all of the mass of the 
Local Group resides in the Milky Way and M31, then the 
barycenter is simply the mass-weighted average of their 
position vectors. This implies in the Galactocentric rest 
frame that the velocity of M31 is Um31 = — t^Mw//M3i, 
where /m3i = M M 3i/(Mm31 + M M w)- The heliocentric 
velocity of M31 is therefore um31 = Um31 — Vq. After 
substitution of the previous equations this yields 

VM31 = -EW/M31 + Vq[(1//m3i) - !]• (8) 

This heliocentric vector can be decomposed into compo- 
nents along the line-of-sight and in the West and North 
directions following the methodology of van der Marel et 
al. (2002). 

Application of equation ([8]) requires that we assume 
a value for /m31- Einasto & Lynden-Bell (1982) used 
the Tully-Fisher relation to constrain this quantity, and 
built this constraint directly into their matrix solution 
for Uq. However, the mass ratio of M31 and the Milky 
Way isn't actually all that well known observationally, 
and different arguments for estimating it have yielded 
different results. Our aim here is to constrain VM3i from 
observational data, while imprinting a minimum amount 
of theoretical prejudice into the result. So we simply 
assume that /m31 is homogeneously distributed between 
0.39 and 0.67 (i.e., M M3 i/M M w = 0.8-2.0). This encom- 
passes most of the values that have been quoted in the 
literature (see discussion in Section [472]) . We then solve 
equation ([5]) in Monte-Carlo fashion, while simultane- 
ously adding in the random errors in Uq and Vq. This 
yields both the best fit result for VM3i and its statistical 
uncertainties. 

We base our analysis on the sample of Local Group 
dwarf galaxies (there are no giant galaxies in the outer 
parts of the Local Group) compiled by Grebel, Gallagher, 
& Harbeck (2003). From their table 1 we removed all 
galaxies listed as being (potentially) part of the Milky 
Way or M31 subgroups. We obtained the heliocentric 
line-of-sight velocities of the galaxies from the NASA 
Extragalactic Database (NED). We removed the Cetus 
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TABLE 2 

Outer Local Group Galaxy Sample 



Name Type RA DEC i>i os D £>bary 

deg deg kms -1 kpc kpc 

(1) (2) (3) (4) (5) (6) (7) 



WLM dlrr IV-V 0.49234 -15.46093 -122 ± 2 945 ± 40 802 

Aquarius/DD0210 dlrr/dSph 311.71585 -12.84792 -141 ± 2 950 ± 50 940 

Leo A dlrr V 149.86025 30.74639 24 ±... 800 ± 40 953 

Tucana dSph 340.45667 -64.41944 130 ± . . . 870 ± 60 1068 

Sag DIG dlrr V 292.49573 -17.67815 -79 ± 1 1060 ± 100 1152 



Note. — The sample of outer Local Group galaxies used in the modeling of Section \2. 3 1 Column (1) lists the galaxy name 
and column (2) its type. Columns (3) and (4) give the position on the sky. Column (5) lists the heliocentric line-of-sight velocity 
and, where available, its error. Velocities and sky positions (RA,DEC) were obtained from the NASA Extragalactic Database 
(NED). Column (6) lists the heliocentric distance and its error, from Grebel et al. (2003). Column (7) lists the Local Group 
barycentric distance Dbary, calculated assuming the M31 distance listed in Section [3.11 and a mass fraction /m31 = 0.53. The 
galaxies in the table are sorted by their value of -Dbary 



dwarf from the sample, since it has no line-of-sight ve- 
locity available. We restricted our primary sample to 
galaxies with a Local Group barycentric distance less 
than ~ 1.2 Mpc. The resulting sample consists of 5 
galaxies, which are listed in Table O With this sample 
the analysis yields for M31: v sys = —405 ± 114 kms -1 , 
vw = — 126 ± 63 km s -1 , vjy = — 89 ±50 km s^ 1 (see also 
Table [3] below). The fit yields an estimate a = 22kms _1 
for the one-dimensional dispersion of the galaxies around 
the space motion of the Local- Group barycenter. The 
small value of this dispersion is due to the fact that 
the sample galaxies reside at an average Local Group 
barycentric distance of 0.98Mpc, which is consistent with 
the Local Group turn-around radius (e.g., Karachentsev 
et al. 2002), where the velocity with respect to the Local 
Group barycenter is zero by definition. 

When comparing the analyses in Section 12.11 and the 
present section, there is an important difference in the 
expected galaxy velocities. The velocities of satellites 
around M31 are virialized, so the expectation value of a 
satellite velocity with respect to M31 is zero, independent 
of where the satellite is located on the sky. By contrast, 
the motions of the outer Local-Group galaxies around 
the Local Group barycenter are not virialized. Therefore, 
the expectation value of a galaxy velocity with respect to 
the Local Group barycenter is zero only if the galaxy is 
near the turn- around radius. If this is not the case, then 
equation ((TJ is valid only if the galaxies are distributed 
homogeneously around the Local Group. In reality, the 
distribution is both non-homogeneous and the number 
of galaxies is small. Therefore, addition to the sample 
of galaxies beyond the turn-around radius is expected 
to add both bias and shot-noise to the estimate of the 
M31 velocity. Moreover, galaxies significantly outside 
the turn-around radius do not necessarily need to follow 
the Local Group barycenter motion. 

The sample in Table [2] consists of a rather small num- 
ber of galaxies. So despite the aforementioned disad- 
vantages, we did study the effect of adding more Local 
Group galaxies at larger barycentric distances. In par- 
ticular, we tried to add the only other 6 Local Group 
galaxies within 2 Mpc (namely: NGC 3109, Antlia, Sex- 
tans A and B, IC 5152 and KKR 25; Grebel et al. 2003) 
that are not believed to be associated with any other 



TABLE 3 

M31 Heliocentric Velocity Estimates 



Method 


^sys 


v w 


v N 




kms -1 


kms 


km s 


(1) 


(2) 


(3) 


(4) 


M31 Satels. 


-270 ± 19 


-136 ± 148 


-5 ± 75 


M33 PM 


-183 ± 76 


-48 ± 80 


71 ± 84 


IC 10 PM 


-346 ± 76 


-16 ± 80 


-47 ± 81 


Outer LG Gals. 


-405 ±114 


-126 ± 63 


-89 ± 50 


Weighted Av. 


-273 ± 18 


-78 ± 41 


-38 ± 34 



Note. — Estimates of the heliocentric velocity of M31 es- 
timated using different methods, as indicated in column (1). 
The method based on the M31 satellite ensemble line-of-sight 
velocities is described in Section 12.11 and that based on the 
observed proper motions (PMs) of M33 and IC 10 is described 
in Section [272] and that based on the line-of-sight velocities of 
the satellites in outer regions of the Local Group is described 
in Section [2. 31 Column (2) lists the estimated M31 systemtic 
line-of-sight velocities. Columns (3) and (4) lists the esti- 
mated M31 transverse velocities in the West and North di- 
rections, respectively. The bottom line of the table lists the 
weighted average of the results from the different methods. 

nearby structures. These reside at barycentric distances 
of 1.6-1.9 Mpc. Analysis of the combined sample of 
11 galaxies yields for M31: v sys = —608 ± 154 kms -1 , 
v w = -82 ± 138 kms -1 , v N = -46 ± 82 kms" 1 . The fit 
yields an estimate a = 50kms _1 for the one-dimensional 
dispersion of the galaxies around the space motion of the 
Local-Group barycenter. The large deviation of v sys from 
the observed value as well as the increased a and formal 
errors support our assertion that adding these distant 
galaxies decreases the quality of the results. Nonethe- 
less, the results for vw and vn are consistent with those 
inferred from the smaller sample, within the errors. So 
we conclude that the results for the M31 transverse mo- 
tion are quite robust, and not very sensitive to the com- 
position of the sample. This is further supported by the 
fact that our results are consistent within the errors with 
the preferred solutions obtained by Einasto & Lynden- 
Bell (1982), despite their use of a sample that is only 
partially overlapping with ours. 

2.4. Comparison and Combination of Constraints 
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The vw and vn for M31 inferred from the different 
methods and listed in Table [3] are shown in Figure [2] 
as colored data points with error bars. The weighted 
averages of all four of the independent estimates are 
(v w ) = -78 ± 41 km s -1 and (v N ) = -38 ± 34 kms" 1 . 
This is shown in the figure as a black data point with 
error bars. 

The x 2 that measures the residuals between the indi- 
vidual measurements in Table [3] and the weighted av- 
erages is x 2 — 8.1 for N — 9 degrees of freedom (12 
measurements minus 3 parameters). Therefore, the re- 
sults for the vw and Vn from the different methods are 
consistent within the errors. Among other things, this 
implies that there is no evidence that the dispersion of 
the peculiar velocities of M31 satellites in the transverse 
direction, which enters into the analysis of Section 12.21 
is larger than the value a = 76kms _1 derived from line- 
of-sight velocities in Section [2~T1 This is consistent with 
the assumption that was made about the isotropy of the 
peculiar velocities. 

The inferred weighted average of the three independent 
systemic velocity estimates is (v sys ) = — 273 ± 18kms _1 . 
This differs at the 1.6a level from the observed M31 ve- 
locity v sys = — 301±lkms _1 (Courteau & van den Bergh 
1999), but the agreement in an absolute sense is better 
than the formal uncertainties in ((i>vk}> ( v n))- So this 
comparison provides no reason to mistrust our assump- 
tions that the M31 satellites (Sections 12.11 and 12. 2p and 
the outer Local Group galaxies (Sect ion l2~3f move on av- 
erage through space with the same velocity as M31 and 
the Local Group barycenter, respectively (see Lynden- 
Bell 1999 for an earlier discussion of this). 

The analyses in the previous sections make assump- 
tions about the velocity distributions of the satellites, but 
not about their spatial distributions. These spatial dis- 
tributions are known to be inhomogeneous. In the case of 
the M31 subgroup, there are more satellites on the near 
side of M31 (i.e., between M31 and the Milky Way) than 
on the far side (e.g., McConnachie & Irwin 2006). More- 
over, recent studies have suggested that (some of) the 
satellites are concentrated near a plane surrounding M31 
(e.g., Koch & Grebel 2006). These facts by themselves do 
not affect our analysis at all, as long as the velocity dis- 
tributions remain isotropic. However, the analysis would 
be affected if the satellite ensemble possessed a mean ro- 
tation. This is possible in some of the scenarios that 
have been suggested for a possible disk-like distribution 
of satellites (but not necessarily the favored scenarios; 
see, e.g., Metz, Kroupa, & Jerjen 2007). In the method 
of Section l2~T1 we fit an apparent solid-body rotation field 
(eq. [3]) to the observed satellite velocities. Any intrinsic 
rotation would therefore bias the inferred transverse ve- 
locity. However, if this had been the case then we might 
have expected the M31 transverse velocity results of this 
method to be inconsistent with the results from the other 
methods that we have used, and in particular the method 
of Section l2~3l based on outer Local Group galaxies. The 
fact that the results from the different methods are ac- 
tually statistically consistent therefore suggests that our 
results are not affected by potential rotation of the M31 
satellite system. 

In the comparisons of the different estimates listed in 
Table [3] it should be noted that they are not completely 
independent, because the line-of-sight velocities of M33 
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Fig. 2. — Estimates of the M31 heliocentric transverse velocity 
in the West and North directions. Data points with error bars 
are from Table [3] based on the M31 satellite ensemble line-of-sight 
velocities (blue), the proper motion of M33 (green), the proper 
motion of IC 10 (red), the line-of-sight velocities of outer Local 
group satellites (magenta) , or the weighted average of these results 
(black). The starred symbol indicates the transverse velocity that 
corresponds to a radial orbit for M31 with respect to the Milky 
Way. The cyan rectangle approximates the region that is ruled 
out in all of the theoretical models explored by Loeb et al. (2005), 
because the resulting relative orbit of M31 and M33 would have 
produced more disruption of the M33 disk than is observed. The 
small dots are the 18% of 30,000 samplings from the error ellipse 
belonging to the black data point that are consistent with the the- 
oretical constraint. 

and IC10 enter not only in the analysis of Section 12.21 
but also in that of Section 12.11 However, this is not 
a very important issue. We verified that if M33 and 
IC10 are removed from the line-of-sight velocity analysis 
in Section 12.11 then the changes in the results are well 
within the uncertainties. 

3. ORBITS 
3.1. M31 -Milky Way Orbit 

To calculate the velocity of M31 in the Galactocen- 
tric rest frame we adopt the same Cartesian coordinate 
system (AT, Y, Z) as in Section 12.31 We adopt a dis- 
tance D = 770 ± 40 kpc for M31 (Holland 1998; Joshi et 
al. 2003; Walker 2003; Brown et al. 2004; McConnachie 
et al. 2005; Ribas et al. 2005). The position of M31 is 
then r = (-379.2, 612.7, -283.1) kpc. To calculate the 
reflex motion of the Sun at the position of M31 we use 
the same solar velocity as in Section 12.31 and we use the 
standard IAU value Rq = 8.5 kpc for the distance of 
the Sun from the Galactic Center (Kerr & Lyndcn-Bell 
1986), to which we assign an uncertainty of 0.5 kpc (none 
of our results depend sensitively on this quantity). The 
velocity of the Sun then projects to (v sys , vyy , «at)o = 
(172, 128, 71) kms" 1 at the position of M31. Since one 
observes the reflex of this, these values must be added 
to the observed M31 velocities to obtain its velocity in 
the Galactocentric rest frame. The velocity vector corre- 
sponding to the observed velocity component v sys given 
in Tableland the inferred ((v\y), (vn)) given in Table [3] 
is then V ohs = (97 ± 35, -67 ± 26, 80 ± 32) kms" 1 . The 
errors (which are correlated between the different com- 
ponents) were obtained by propagation of the errors in 



M31 Transverse Velocity and Local Group Mass 



7 



the individual position and velocity quantities (including 
those for the Sun) using a Monte-Carlo scheme. 

If the transverse velocity of M31 in the Galactocen- 
tric rest frame, Van, equals zero, then M31 moves 
straight towards the Milky Way on a purely radial 
(head-on collision) orbit. This orbit has (ww, w Jv)rad = 
(—127, —71) kms -1 (this is approximately the reflex of 
the velocity of the Sun quoted above, because the lines 
from the Sun to M31 and from the Galactic Center to 
M31 are almost parallel). The radial orbit is indicated 
as a starred symbol in Figure [2j The velocity V>bs cal- 
culated in the previous paragraph has tangential and 
radial components Van,obs = 59kms~ 1 and Vad,obs = 
-130 km s" 1 . The total velocity is |V obs | = 142 km s -1 . 

The value V ra d,obs is an unbiased estimator of the 
true radial velocity V a d- The associated uncertainty 
can be calculated in straightforward fashion using the 
previously described Monte-Carlo scheme. This yields 
Vad = —130 ± 8 kins" 1 . The uncertainty is due mostly 
to the uncertainty in our knowledge of the circular ve- 
locity of the LSR in the Galactic plane. The values of 
Vtan,obs and |Vobs| are more difficult to interpret, because 
they are not unbiased estimators of the true velocities 
Vtan and V|. This is because the area coverage of Van 
values in the (vw, vn) plane scales as 27rVan^Van, which 
produces a bias in the sense that any measurement er- 
ror in (vw,vn) tends to yield overestimates of Vtan and 
|V|. To quantify and correct these biases we used Bayes' 
theorem, which yields the identity 

P(Van|V an , obs ) cx P(V an )P(V an ,ob 8 |Van) (9) 

We are interested in the quantity on the left-hand side, 
which is the probability distribution of Van, given our 
measurement. The quantity P(V an ) on the right-hand 
side is the Bayesian prior probability of Van, which we 
assume to be flat (i.e., homogeneous). The quantity 
-P(Vt an ,obs| Vtan) is the probability of measuring a value 
Vtan,obs if the actual value is Vtan- This latter distribution 
is easily calculated using Monte-Carlo drawings, because 
the measurement uncertainties are known. Once the dis- 
tributions P(Van,obs|Van) have been pre-calculated for 
all Vt arl , it is straightforward to obtain a Monte-Carlo 
sampling of the probability distribution P(Van|Van,obs)- 
To this end one draws a random deviate Vtan, and then 
accepts this value with probability P(V an ,obs| Van)- The 
top left panel of Figure [3] shows the probability dis- 
tribution -P(Vtan|Vtan,obs) thus obtained. The median 
Vt an = 42 kms" 1 . The fact that this is smaller than 
Vtan.obs = 59kms~ 1 quantifies the aforementioned bias. 
The lcr confidence interval is Van < 56kms _1 . So the 
radial orbit is consistent with the data at this confidence 
level. These results are discussed in the context of pre- 
vious model predictions in Section [4.11 When combined 
with the value for Vad, the ltr confidence interval around 
the median for the total velocity is V = 1381]^ kms" 1 . 

To get insight into the relative orbit of M31 and the 
Milky Way we assume that they can be approximated 
as point masses of mass Mm3i and Mmw, respectively. 
In the center-of-mass frame, their orbit then has en- 
ergy E = i/i|Vp — G/jM/\r\ and angular momentum 

L = n\r x V|, where the total mass is Af = Mm31 + Mmw 
and the reduced mass is [i = Mm_3iMmw / M . The galax- 



ies are bound to each other (E < 0) if M > A/bind, min = 

\r \ | V\ 2 /2G. The orbit of the separation vector r is then a 
Kepler ellipse with eccentricity e 2 = l + (2PL 2 /G 2 Af 2 /j 3 ) 
and semi-major axis length a = L 2 /[GA//i 2 (l — e 2 )]. The 
pericenter separation is r pcr i = a/(l — e). The period is 
T = 2n(a 3 /GM) 1 / 2 . The orbit can be parameterized us- 
ing the eccentric anomaly rj as (e.g., Kibble 1985; Binncy 
& Tremaine 1987) 

r = a(l — e cos rj), 

t={a 3 /GM) 1/2 (r) - e sin 77). (10) 

In this parameterization rj = corresponds to the peri- 
center passage at t = 0, while r) = 2n corresponds to the 
next pericenter passage at t = T. The radial and tan- 
gential velocities can be similarly parameterized as (e.g., 
Kochanek 1996) 

V rad = (GM/a) 1 / 2 (esim 1 )/(l - ecosrj), 

V^n = (GM/a)^ 2 (l - e 2 ) 1 / 2 /(l - ecos V ). (11) 

It is generally assumed that the Local Group is a bound 
system that, due to its overdensity, decoupled from the 
Hubble expansion at fairly high rcdshift. Since then the 
orbital evolution of M31 and the Milky Way has been 
governed by Newtonian dynamics. Given this scenario 
and the orbital description given by equation (jTQI) , the 
Big Bang must have corresponded to a previous pericen- 
ter passage, which we can take to be t = 0. The current 
time t then corresponds to the age of the Universe, which 
has been tightly constrained using data from the Wilkin- 
son Microwave Anisotropy Probe to be t — 13.73+Q'jjGyr 
(Spergel et al. 2007). With measurements of the current 
Galactocentric distance r and velocities Vad and Van, 
the equations (flQ)) and (fTTj) can be solved for the quanti- 
ties Af , a, e and 77. The quantity rj must be in the interval 
[ir, 2ir] (so that M31 and the Milky Way are falling to- 
wards each other for the first time), since unplausibly 
high masses M are otherwise required. 

This methodology for modeling the Local Group is 
commonly called the "timing-argument" . It has been 
widely applied and discussed in the literature (e.g., Kahn 
& Woltjer 1959; Lynden-Bell 1981, 1999; Einasto & 
Lynden-Bell 1982; Raychaudhury & Lynden-Bell 1989; 
Kroeker & Carlberg 1991; Kochanek 1996; Loeb et 
al. 2005), mostly to obtain joint estimates for the age 
of the Universe and the mass of the Local Group, often 
assuming a purely radial orbit. We can now do a more 
accurate analysis, both because we have an observational 
estimate of V an , and because the age of the Universe can 
be assumed to be well-known from independent data. We 
have used this methodology on our results. Monte-Carlo 
error propagation was performed as described above, to 
obtain full probability distributions of M , a, e, rj, T, 
and r pcr i. The results are shown as histograms in Fig- 
ure [3l The inferred la confidence intervals around the 
median are: M = 5.58±g;f| x 10 12 M Q ; a — 561±fJ kpc; 

77 = 4.30lto;o45 radians; r pcri = 23 kpc, with lcr confi- 
dence interval r pcri < 40.9 kpc; T = 16.70t°;2g G Y r ; and 
e = 0.959, with lcr confidence interval (1 — e) < 0.072. 
The implications of this result for M are discussed in 
Section ^. 21 The uncertainties in the listed quantities are 
due primarily to the uncertainties in V an and the M31 
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Fig. 3. — Probability histograms of: M31 Galactocentric tangential velocity Vtan; minimum total mass M^ind min of M31 and the Milky 
Way if the galaxies are bound; and assuming a bound orbit and the timing argument, the M31-Milky Way orbital pericenter distance r per ;, 
orbital eccentricity e, orbital semi-major axis length a, orbital period T, current eccentric anomaly rj, total mass M t i m i ng , and implied 
turnaround radius ro for test-particles on radial orbits around the M31-Milky Way system. The probability distributions were obtain using 
Monte-Carlo simulations as described in the text. The vertical scales are arbitrary. The blue curves take into account all observational 
uncertainties in the distances and velocities of both M31 and the Sun in the Galactocentric rest frame, as well as the observational 
uncertainties in the age of the Universe. The red curves also enforce the theoretical exclusion zone of Loeb et al. 2005 (cyan region in 
Figure [2}, within which more tidal deformation of M33 would have been expected than is observed. 



distance D. The uncertainties in these quantities con- 
tribute more or less equally to the uncertainties in M, a, 
T, and 77. The uncertainties in the distance have little 
effect on e and r por i. All quantities vary monotonically 
with V4an and D. Larger values of Vtan yield larger values 
of M , a, rj, T, and r pcr i, and smaller values of e. Larger 
values of D yield larger values of M , a, e, T, and r pcr j, 
and smaller values of r\. Figure[3]also shows the probabil- 
ity distribution of A/bind. min, the minimum mass required 
for a bound orbit. The inferred la confidence intervals 
around its median is M bin d,min = l.Tlt^l x 10 12 M . 

The anticipated collision between M31 and the Milky 
Way will happen at the next orbital pericenter, which 
in the Kepler model is at T — t « 3.0 ± 0.3 Gyr from 
now. The orbital pericenter distance is much smaller 
than the sizes of the galaxies' dark halos. If the galax- 
ies are assumed to have a logarithmic potential that re- 
produces the observed rotation curve amplitudes, then 
their radial extents r t are 235(A/ M31 /3.4 x 10 12 M ) kpc 



and 207(M M w/2-3 x 10 12 M Q ) kpc, respectively (Loeb et 
al. 2005). The dark halos will intersect once the orbital 
separation becomes smaller than the sums of these sizes. 
This will have two consequences. First, the orbits will 
deviate from Kepler ellipses in the sense that the orbit 
will have less curvature and larger r por i than indicated by 
the previously derived Kepler orbit. Second, there will be 
dynamical friction, which will tend to increase the cur- 
vature and will tend to decrease r perl . More complicated 
calculations are necessary to properly calculate the orbit 
once the dark halos of the galaxies start to overlap and 
to study the properties of the resulting merger. Such cal- 
culations were recently presented by Cox & Loeb (2007). 
However, they adopted an orbit with Vtan — 132 kms , 
^pcri = 450 kpc, and e = 0.494. Comparison to the top 
panel of Figure [3] shows that this orbit is not consistent 
with our measurement of the M31 transverse velocity 

3.2. M31-M33 Orbit 
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Loeb et al. (2005; hereafter L05) recently derived a 
theoretical constraint on the transverse motion of M31 
from the fact that M33 is relatively undisturbed. This 
appears to rule out orbits where M33 had a previous 
close interaction with M31. The exact shape of the re- 
gion in {vw,vn) space thus ruled out (being defined by 
L05 as: more than 20% of the M33 stars would have been 
stripped) has a complex shape and depends somewhat on 
the modeling assumptions, but it can be approximated 
by the solid cyan rectangle in Figure^ This is an approx- 
imation to figure 2c of L05 (which quantified the trans- 
verse motion using Galactocentric rest-frame velocities 
V a cosS — —vw — 128 kms -1 and Vg — wjv + 71 kms -1 ). 
In some of their models (see their figures 2a, b) even a 
somewhat larger region is ruled out. 

The overall shape of the excluded region in (vw,vn) 
space can be understood with fairly simple calculations 
and arguments. In doing so, we ignore the observa- 
tional uncertainties in the quantities of interest. This 
is sufficient for the scope of the present discussion, but 
should be included for more quantitative understanding 
of the M31-M33 orbit. Based on the data from Ta- 
ble Q] and the M33 and M31 distances from Sections 12.21 
and l3.11 respectively, the distance between M33 and M31 
is 203kpc. The center of the excluded region in Figure[2]is 
at (vw,vn) ~ (— 68, — ll)kms _1 . This defines the three- 
dimensional velocity vector of M31, while for M33 that 
vector is known from Brunthaler et al. (2005). The radial 
and tangential velocity components of the separation vec- 
tor are then T^ad = —71 krns" 1 and Vt an = 126 kms -1 . 
The Kepler orbit of the separation vector can be cal- 
culated similarly as in Section 13.11 In doing so, we 
assume that Mm3i = 3.0 x 10 12 , which is based on 
M = 5.58 x 10 12 M Q (SectionEnj and / M3 i « 0.54 (com- 
pare Section |2~3")) . We assume that the M33-to-M31 mass 
ratio is equal to the value ~ (118/250) 4 suggested by the 
Tully-Fisher relation and the galaxy's circular velocities 
(Corbelli & Schneider 1997; Klypin et al. 2002). The or- 
bit then has a — 120kpc, r pcr i = 27kpc, T = 2.2 Gyr, and 
e = 0.77. The value of a is close to the minimum that 
this quantity can attain (i.e., maximum binding energy) 
as a function of (vw, Vjv)> which is a mm — 113kpc. Given 
the value of a, M33 moves inside of the M31 dark halo 
for most of its orbit. Our assumption that all of the M31 
mass resides at its center therefore overestimates the cur- 
vature of the orbit. The listed pericenter distance should 
thus be interpreted as a lower bound on the actual value. 
More detailed calculations, as in L05, are required to get 
a proper estimate of this quantity. 

As one moves from (vw,Vn) ~ (— 68, — 11) kms -1 
along the diagonal that runs from the bottom left to 
the top right in Figure [21 the Kepler orbit value of r per i 
doesn't change much. However, a increases, which means 
that the orbits become more eccentric and less bound 
(and ultimately unbound). The galaxies therefore spend 
less time in close vicinity of each other, and the rela- 
tive velocity at pericenter increases. By contrast, as one 
moves from (vw, ww) ~ (—68, —11) krns -1 along the di- 
agonal that runs from the top left to the bottom right in 
Figure[2j the Kepler orbit value of e doesn't change much. 
But again, a increases, which means that the orbits have 
larger pericenter separations and become less bound (and 
ultimately unbound). The energy dissipated during an 
encounter scales as dE oc l/(r A V 2 ) in the impulsive tidal 



approximation (e.g., Binney & Tremaine 1987, eq. [7- 
55]), where r is the impact parameter. Therefore, as one 
moves away along either diagonal in Figure [2J the struc- 
ture of M33 will be less perturbed by the encounter. The 
excluded region in Figure [2] can therefore be understood 
to lowest order as the region where the orbital integral 
over dE exceeds some threshold, with r and V calculated 
on the basis of the Kepler orbit. 

Interestingly, the observationally implied transverse 
velocity of M31 from Table [3] falls right in the region for 
which considerable tidal deformation of M33 would have 
been expected. This M31 velocity yields a Kepler orbit 
for the M31-M33 separation vector with a = 127 kpc, 
Tpcri = 30 kpc, T = 2.4 Gyr, and e = 0.76. However, 
the uncertainties in the observationally implied velocities 
cannot be ignored. Upon performing Monte-Carlo sam- 
pling we find that as much as 18% of samplings from the 
error ellipse fall outside the cyan rectangle in Figure [2l 
and therefore do not violate the M33 tidal stripping ar- 
gument. These 18% are shown as small black dots for a 
total sample of 30,000 drawings. The black dots can be 
viewed as a visual representation of the probability dis- 
tribution of M31's transverse velocity obtained by taking 
into account not only the observational constraints de- 
rived here, but also the theoretical M33 stripping argu- 
ment of L05. Since most of the dots fall close to the cyan 
rectangle, it is likely that there has been some tidal de- 
formation of M33 by M31 (although not sufficient to pass 
L05's threshold for being considered excluded). This sug- 
gests that it will be worthwhile to perform deep searches 
for tidal tails and structures in the outer regions of M33, 
similar to those that have already been performed for 
M31 (e.g., Ferguson et al. 2002). 

Our results for the relative M31-M33 orbit involve a 
small amount of circular reasoning, since we have as- 
sumed a priori that the residual space motion of M33 
with respect M31 has the same dispersion (a = 76kms _1 
per coordinate, as derived from line-of-sight velocities in 
Section 12. ip as do the other M31 satellites. To avoid 
this circular reasoning one could ignore the result labeled 
"M33 PM" in Table [3] (green cross in Figure [2]). The 
weighted average of the remaining M31 transverse mo- 
tion estimates then becomes (vw) = — 89±47kms _1 and 
(v N ) = -60 ± 37 krns" 1 . This differs by only 24 krns" 1 
from what we have used so far (last line of Table [3]), 
and this difference is well within the uncertainties. This 
modified result still falls well inside the region excluded 
by L05. Therefore, our conclusions about the M31-M33 
orbit are not influenced by the fact that we have used 
the M33 space velocity as one of the estimators of the 
M31 space velocity. 

L05 only considered models with Mm3i = 2.6-3.4 x 
10 12 M©. As is discussed in Section 14.21 below, it is 
possible that the M31 mass is actually lower than this. 
For example, Klypin et al. (2002) advocate Mm3i — 
1.6 x 10 12 Mq. It follows both from the simple argu- 
ments of Section 13.21 and from the detailed calculations 
of L05 (their figure 2) that the amount of past tidal de- 
formation of M33 is smaller for smaller values of Mmsi • 
This would reduce the area of the excluded rectangular 
region in Figure [21 and would reduce the concern that 
our observational estimate of the M31 transverse veloc- 
ity falls in the region of parameter space that was disfa- 
vored by L05. Also, the models of L05 (and hence the 
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cyan region in Figure [2]) do not account explicitly for the 
observational uncertainties in the assumed distances of 
M31 and M33. If the actual distances differ at the 1 or 
2a level from the canonical values, then this could affect 
the location of the excluded region in transverse- velocity 
space. 

It is straightforward to include the theoretical con- 
straint of L05 into the M31-Milky Way timing argument 
calculations of Section 13.11 To address this, we applied 
the same Monte-Carlo scheme as in that section, but 
now with rejection of all Monte-Carlo drawn velocities 
with (vw,vn) combinations in the region excluded by 
L05. This yields the probability histograms shown in 
red in Figure [31 The distributions of Vtan, e and r pcr j be- 
come rather non-Gaussian, as can be easily understood 
from the distribution of points in Figure [21 However, 
the other distributions remain close to Gaussian, and are 
not very different from those obtained without using the 
L05 M33 stripping argument. The same is true for the 
inferred la confidence intervals around the median val- 
ues. For example, for the Local Group mass we obtain 
M = 5.50t^76 x 1Ql2 M ©' which is similar to the result 
of Section 13.11 

4. DISCUSSION 
4.1. M31 Tangential Velocity 

Many previous studies of the M31-Milky Way system 
have assumed that their orbit can be approximated to be 
radial (Vtan ~ 0). This simplifies analyses based on the 
timing argument and, in the absence of a reliable Vtan 
measurement, was a reasonable guess based on simple 
cosmological arguments. In the absence of mutual grav- 
itational interactions, peculiar velocities with respect to 
the Hubble flow decrease with time as (1 + z). In the 
M31-Milky Way system there is mutual gravitational at- 
traction along the galaxy separation vector. This changes 
the radial velocity Vrad from positive (receding) at high z 
to negative (approaching) at the present time, as quan- 
tified by the timing argument. However, the angular 
momentum is conserved in a two-body system without 
external perturbations. Therefore, a significant present- 
day tangential motion in such as system implies an un- 
realistically high peculiar velocity at high redshift. 

The situation is more complicated when the possibil- 
ity of angular momentum exchange is taken into account. 
Tidal torques can lead to exchange between the spin an- 
gular momentum of the galaxies and their orbital angu- 
lar momentum. This process may have contributed both 
to the observed spins of M31 and the Milky- Way, and to 
the tangential velocity component in their orbital motion 
(Gott & Thuan 1978; Dunn & Laflamme 1993). More im- 
portantly, tidal torques exerted by the galaxies outside of 
the Local Group also induce a tangential velocity com- 
ponent in the M31-Milky Way system (Raychaudhury 
& Lynden-Bell 1989). A useful approach to study this 
effect, and more generally, the orbits of all the galaxies 
in the nearby Universe, is based on the principle of least 
action (Peebles 1989). This assumes that nearby galax- 
ies arrived at their present configuration through grav- 
itational interactions from a nearly homogeneous high- 
rcdshift state with negligible peculiar velocities. Allowed 
solutions are those that minimize the relevant Hamilto- 
nian action integral. This method can be used to fit ob- 



served galaxy velocities starting from their observed po- 
sitions (Peebles et al. 1989; Peebles 1994), to fit observed 
galaxy distances starting from their observed velocities 
(Shaya, Peebles, & Tully 1995; Schmoldt & Saha 1998), 
or to fit observed galaxy velocities and distances simul- 
taneously (Peebles et al. 2001, hereafter P01; Pasetto & 
Chiosi 2007). The general prediction from the theoretical 
work that includes tidal torques is that M31 tangential 
velocities of Vtan S 200 kms -1 are expected in plausible 
models. 1 

Figure 6 of P01 shows the M31 transverse velocity vec- 
tors predicted in 30 minimum-action solutions for the 
nearby Universe. The reason that there are multiple 
possible solutions is due to the absence of observational 
knowledge of most galaxy proper motions. P01 char- 
acterized the M31 velocity using supergalactic angular 
coordinates in the Galactocentric rest frame. These are 
related to the heliocentric transverse velocity vt and the 
position angle ©t of the transverse motion on the sky, 
defined as in equation ([2]), according to 

v SGL = -v t cos(O t + 34.60°) - 131 km s -1 , 

v SGB = Vt sin(9 t + 34.60°) -64 km s -1 . (12) 

Our weighted average velocity ((%}, (vn)) given in Ta- 
ble [3] corresponds to (wsgl,^sgb) = (— 55, — 21) kms -1 . 
The 68.3% confidence ellipse around this measurement 
encloses 8 of the 30 viable solutions presented by POL 
Our measurement is therefore fully consistent with their 
theoretical work. The action method is based on the as- 
sumptions that mass follows light and that the galaxy 
peculiar velocities are due to their mutual gravitational 
interactions. Our M31 transverse motion determination 
therefore provides no reason to doubt these assump- 
tions (although cosmological N-body simulations sug- 
gest that these assumptions are at best only approxi- 
mately satisfied; Martinez- Vaquero, Yepes, & Hoffman 
2007). The mass Mm3i + Mmw assumed in the P01 
models is 5.16 x 10 12 M Q , which is consistent with the 
range calculated in Figure [3] based on the timing argu- 
ment. The mass was not varied independently in P01, 
but is within the factor ~ 2 range of masses for which 
the action method yields plausible solutions (Peebles et 
al. 1994). Pasetto & Chiosi (2007) obtained a best-fit 
solution from their action modeling that corresponds to 
a heliocentric velocity (vw,vn) = (— 142, — 41) kms -1 . 
This value is near the edge of our 68.3% confidence el- 
lipse, and is therefore also consistent with our measure- 
ment. The mass .Mm 31 + Mmw assumed in their models 
is 5.36 x 10 12 M©, which is also within the range calcu- 
lated in Figure [3] based on the timing argument. 

In summary, our measurement of the transverse veloc- 
ity of M31 is consistent with the most recent theoretical 
models. Moreover, the fact that 73% of POl's action- 
method solutions do not fall within our 68.3% confidence 
ellipse suggests that our measurement has sufficient ac- 
curacy to provide meaningful constraints on the allowed 

1 The mass M = Mm3i + Mmw calculated with the timing 
argument is a monotonically increasing function of Vtan- So if 
one pre-assumes an upper limit to M, then one also obtains an 
upper limit to Vtan- L05 used this approach to obtain Vtan ^ 
120 kms -1 based on the assumption that M < 5.6 X 10 12 Mg. 
However, no physical motivation was provided for this assumed 
mass limit. Larger values of Vtan are not inconsistent with the 
timing argument, but they do require higher masses. 
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solution space of the Local Group orbits and its history. 

4.2. Local Group Mass 

In Section 13.11 we calculated that the minimum M = 
Mm31 + Mmw required for a bound orbit is Mbind,rain = 
1.72±o 25 x 10 12 M and that for a bound orbit the tim- 
ing argument implies that M = 5.58+°:?2 x 1Ql2 M o- 
These results are based on point mass dynamics, where 
the point masses are proxys for entire the Milky Way sub- 
group of galaxies and the entire M31 subgroup of galaxies 
(i.e., the parent galaxy plus its satellites), respectively. 
Only a negligible fraction of the light in the Local Group 
comes from galaxies that are not part of either of these 
subgroups (see Table [5]) . Therefore, the mass M is a 
proxy for the total mass of the Local Group. 

It is of interest to compare our mass estimates to those 
derived using independent arguments. Kochanek (1996) 
modeled the mass distribution of the Milky Way using 
equilibrium models for the velocities of Milky Way satel- 
lites, with the circular velocity of the disk and the es- 
cape velocity of stars in the solar neighborhood as addi- 
tional constraints. This yielded a fairly well constrained 
mass of (0.5 ± 0.1) x 10 12 M Q inside of 50 kpc. How- 
ever, the total Milky Way mass depends on the extent of 
the dark halo, which is poorly constrained by any data. 
Kochanek's solutions therefore allowed total masses any- 
where from ~ 1-5 xlO 12 M (see his figure 7). Wilkin- 
son & Evans (1999) obtained a similarly large range from 
their models, M = 1.9~ti 7Xl0 12 M Q , for the same reason. 
Both of these studies used rather arbitrarily parameter- 
ized density profiles. Klypin, Zhao & Somerville (2002; 
hereafter K02) improved this situation by restricting the 
discussion to models with ACDM cosmologically moti- 
vated density profiles and concentrations. This yielded 
a much more tightly constrained total mass. Their fa- 
vored solution (their model Al) has a total virial mass 
of 1.0 x 10 12 M Q . The highest-mass model that is still con- 
sistent with the observational and theoretical constraints 
(their model A4; see also Besla et al. 2007) has a total 
virial mass of 2.0 x 10 12 M Q . 

Evans & Wilkinson (2000) and Evans et al. (2000) 
modeled the mass distribution of M31 using equilibrium 
models for the velocities of M31 satellites and halo trac- 
ers. This yielded M = 0.8tjj x 10 12 M©, suggesting 
that M31 be less massive than the Milky Way. However, 
as for the Milky Way modeling, the allowed mass range 
was large due to uncertainties in the dark halo extent. 
Again, K02 improved this situation by imposing theoret- 
ical constraints from ACDM modeling. This yielded a 
favored model (their model Bl) with a total virial mass 
of 1.6 x 1O 12 M . The ratio M M31 /M MW = 1.6 for the fa- 
vored models in the K02 study (this implies /m3i = 0.62, 
as defined in Section [2~3]) . This is similar to the value im- 
plied by the ratio of the galaxy's circular velocities and 
the Tully-Fisher relation, ~ (250/220) 4 = 1.67 (Einasto 
& Lynden-Bell 1982). 

K02 did not explore what maximum mass might still be 
consistent with the observational data for M31. However, 
to lowest order one may guess that the situation is simi- 
lar as for the Milky Way, where the highest allowed mass 
is twice the favored mass. The maximum Local Group 
mass for the K02 models is therefore ~ 5.2 x 10 12 M , 
while the favored mass is M = 2.6 x 10 12 M . This as- 



sumes that the derived virial masses of the Milky Way 
and Andromeda apply to their entire subgroups of satel- 
lites. This is reasonable, given the uncertainties already 
inherent in the estimates. The largest satellites of both 
subgroups (the Large Magellanic Cloud and M33, re- 
spectively) have < 10% of the luminosity of their parent 
galaxy, and most other satellites contribute < 1%. Both 
the favored and the maximum mass of K02 exceed the 
value M b i„d,mm = l-72io 2 5 x 10 12 M derived in Sec- 
tion EIU This suggests that the Milky Way and M31 are 
indeed a bound system, as is usually assumed. 

We showed that the timing argument for a bound M31- 
Milky Way system implies a mass M — 5.58+o'72 x 
10 12 M . This is consistent with the maximum mass al- 
lowed by the K02 models, but not with the favored mass. 
Even the tail of the probability distribution for A/timing 
in Figure [3] doesn't go as low as M = 2.6 x 10 12 M . 
The best agreements are obtained for the case of nearly 
radial orbits and a small M31 distance. For example, 
V tw = and D = 700kpc yields M t i ming = 4.4 x 1O 12 M . 
But while this Vtan is consistent with our measurements, 
such a low distance is almost 2er away from the value 
D = 770 ± 40 kpc inferred using a wide range of methods 
(Holland 1998; Joshi et al. 2003; Walker 2003; Brown et 
al. 2004; McConnachie et al. 2005; Ribas et al. 2005). 

The timing argument provides a rather simplistic view 
of the evolution of a binary galaxy in a cosmological con- 
text. Several authors have therefore quantitatively stud- 
ied its accuracy. Models that address the gravitational 
influence and torques of structures outside of the Local 
Group have generally concluded that these do not signif- 
icantly bias the mass estimates obtained from the timing 
argument (Raychaudhury & Lynden-Bell 1989; Peebles 
et al. 1989, 2001; Pasetto & Chiosi 2007; see also Sec- 
tion |4T]). However, these models generally treat individ- 
ual galaxies as point masses. So this does not address the 
fact that the galaxies themselves assemble hierarchically 
over time. 

To include hierarchical assembly it is necessary to per- 
form more detailed dynamical calculations. Kroeker & 
Carlberg (1991) studied the accuracy of the timing argu- 
ment by examining binary galaxies found in an A^-body 
simulation of a closed CDM Universe. They found that 
when Vtan is available, the timing argument provides an 
unbiased estimate of the total mass (measured in two 
spheres, centered on each galaxy, of radii one half the 
inter-galaxy separation). But estimates based on radial 
orbits yielded masses that were on average too low by a 
factor ~ 1.7. They also found that the galaxies must be 
bound for the timing argument to work, since it overes- 
timates the actual mass of unbound pairs. The indepen- 
dent mass estimates of the Milky Way and M31 discussed 
above suggest that the galaxies are in fact bound, so this 
should not be an issue here. Kroeker & Carlberg also 
show that the timing argument may not work well for 
pairs that are not isolated. This too should not be much 
of an issue for the M31-Milky Way system, given that 
the Sculptor, M81, and Maffei groups are all some 3Mpc 
away. 

Goldberg (2001) reached somewhat different conclu- 
sions than Kroeker & Carlberg. He modeled binary 
galaxies using a perturbative least action approach in 
which the galaxies are part of a continuous fluid that 
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collapses over the course of the simulation. The results 
were cross-validated against an N-body simulation, with 
consistent results. From his calculations he concluded 
that the timing argument tends to overestimate the to- 
tal mass of a pair by a factor of ~ 2. The reason for 
the discrepancy between this result and that of Kroeker 
& Carlberg is unclear. However, Goldberg's calculations 
are quite idealized, while Kroeker & Carlberg's calcula- 
tions are now quite dated (both in terms of numerical 
sophistication and underlying cosmology) . A more mod- 
ern theoretical investigation of the accuracy of timing ar- 
gument was therefore presented recently by Li & White 
(2008). 2 

Li & White identified galaxy pairs in the so-called 
Millennium Simulation with properties similar to the 
Milky Way-M31 system. The high quality and detail 
of this cosmological simulation provides excellent statis- 
tics, with thousands of pairs available for study. They 
found that the radial- orbit timing mass estimator pro- 
vides an unbiased estimate of the total mass of a pair 
(defined as the sum of the M200 values, where M200 is 
the mass inside the sphere over which the mean density is 
200 times that of the Universe at large). With their pre- 
ferred selection cuts, they found the ratio of the true to 
the estimated mass to be A — 0.99±0.41, where we trans- 
formed their quoted interquartile range to a traditional 
Gaussian dispersion. The simulations do not confirm the 
finding of Kroeker & Carlberg (1991) that the bias in 
the estimate depends on Vtan, but they do indicate that 
the dispersion in the estimate increases with T4an- Either 
way, the radial-orbit analysis of Li & White is reasonable 
for the Milky Way-M31 system, since we have found 
that a radial orbit is consistent with the available ob- 
servational constraints. The calculated "theoretical un- 
certainty" exceeds the uncertainties calculated here (see 
Figure [3]) that result from propagation of observational 
errors. This theoretical uncertainty therefore drives the 
accuracy which we the Local Group mass is known from 
the timing argument. The preferred Local Group mass of 
K02 corresponds to a ratio A — 0.47. This is 1.2a away 
from timing mass that we have calculated (with a being 
the quadrature sum of the theoretical and observational 
uncertainties). Therefore, the two results are quite con- 
sistent. K02 quote a mass uncertainty of a factor ~ 2, 
which is not too dissimilar from the timing mass uncer- 
tainty calculated by Li & White (2008). Therefore, it is 
not a priori clear which estimate is to be preferred. 

4.3. Local Group Turn-Around Radius 

An independent mass estimate is obtained from the 
turn-around radius ro of the Local Group, defined as 
the distance from the Local Group barycenter at which 
galaxies have a radial velocity of zero (e.g., Lynden-Bcll 
1981). Using the same Keplerian formalism as in Sec- 
tion 13.11 galaxies on the turn-around surface have 77 = n 
and Urad = 0. Equation (|10)1 therefore gives that 

r = [GA/(t/^) 2 ] 1/3 (l + e), (13) 

where, as before, t is the age of the Universe and ro is 
usually evaluated for radial orbits (e = 1). Figure [3] 
shows the probability distribution of the predicted ro 

2 This paper came to our attention after the original submission 
of our own work. 



thus obtained, with M taken from the timing argument. 
The inferred la confidence interval around the median is 
ro = l-56±g;g? Mpc. 

The ro predicted from the M31-Milky Way timing 
argument can be compared to the observationally in- 
ferred value, ro = 0.94 ± 0.10 Mpc (e.g., Karachentsev 
et al. 2002). These values are not consistent. Since ro oc 
A/ 1 / 3 , the observed ro implies a Local Group mass of only 
M = (1.3 ± 0.3) x 10 12 M Q . Similar results are obtained 
when this method is applied to other nearby groups 
(Karachentsev et al. 2005). This mass is lower than that 
implied by the timing argument (as stressed previously 
by Lynden-Bcll 1999) and it is also lower than that im- 
plied by almost all studies of the individual masses of the 
Milky Way and M31 (see Section |4~2")) . Moreover, com- 
parison of this mass to Af b ind,min = l>72l2;|s x 10 12 M Q 
from Section 13.11 suggests that M31 and the Milky Way 
then do not form bound pair. This would make the Lo- 
cal Group little more than a chance superposition of two 
spiral galaxies. Such a configuration has fairly low a pri- 
ori probability, given the local density of spiral galaxies 
(van den Bergh 1971). 

The agreement between the observed and predicted 
turn-around radii improves if the galaxies defining the 
turn-around surface are not on radial orbits. A value 
e = 0.2 in equation (fT3")) would yield perfect agreement. 
However, the corresponding orbits are then nearly cir- 
cular and this would produce a considerable thickness 
in the zero-velocity surface which is inconsistent with 
observations (Lynden-Bell 1999). So a more reasonable 
conclusion may be that mass estimates based on the turn- 
around radius are systematically biased towards low val- 
ues. Direct evidence for this comes from the action mod- 
eling of, e.g., Peebles et al. (2001). Their models repro- 
duce both the distances and redshifts of the galaxies in 
the outer regions of the Local Group (see Table ^ , de- 
spite the use of a Local Group mass M = 5.16 x 10 12 M 
that significantly exceeds the mass implied by equa- 
tion (fl3)) . This suggests that the isolated collapse picture 
on which the concept of a turn-around surface is based 
may not be suitable for the Local Group. The under- 
lying reason may be that the dynamical structure near 
the edges of the Local Group is significantly influenced 
by structures in the mass distribution on larger scales. 
It should also be noted that the even in idealized models 
the zero- velocity surface is not actually expected to be a 
sphere, but an elongated ellipsoid (e.g., figure 6 in Pee- 
bles et al. 1989). This is not generally taken into account 
in observational determinations of ro- 

5. CONCLUSIONS 

We estimated the transverse motion of M31 under the 
assumption that the satellites of M31 follow the motion 
of M31 through space, and under the assumption that 
the galaxies in the outer parts of the Local Group fol- 
low the motion of the Local Group barycenter through 
space. The first method was applied independently to 
a sample of 17 satellites with known line-of-sight veloci- 
ties, and to the 2 satellites M33 and IC 10 with known 
proper motions. The second method was applied to a 
sample of 5 galaxies near the Local Group turn-around 
radius (and we showed that inclusion of 6 more distant 
galaxies does not significantly affect the results). The re- 
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suits from the different methods are mutually consistent 
and successfully recover the known line-of-sight velocity 
of M31. The weighted average heliocentric transverse ve- 
locity of M31 from the different methods in the West and 
North directions are found to be (vw) = — 78±41kms _1 
and (v N ) = -38±34kms- 1 . 

We used the known line-of-sight velocity with the 
newly inferred transverse velocity to determine the ra- 
dial and tangential Galactocentric velocity components 
of M31. We used a Bayesian analysis to obtain the statis- 
tical probability distribution for Vtam properly corrected 
for observationally induced errors and biases. The results 
are that K-ad = —130 ± 8 kms" 1 and that the probabil- 
ity distribution for Vt an has a median value of 42 kms -1 
and a ler confidence interval Vtan < 56kms~ 1 . A purely 
radial orbit is consistent with the data at this confidence 
level. These velocities are consistent with the predictions 
of the most recent action-method models for the history 
of the Local Group, and have small enough errors that 
they can start to meaningfully constrain the full solution 
space allowed by these models. 

We used the M31 velocity vector to constrain the rela- 
tive M31-Milky Way orbit. The minimum mass required 
for a bound orbit is M bindimin = 1.72^°;^ x 10 12 M . If 
the orbit is indeed bound, then the timing argument com- 
bined with the known age of the Universe implies that 
M = 5.58to y2 x 10 12 M . This is consistent with mass 
range suggested by cosmologically motivated models for 
the individual structure and dynamics of M31 and the 
Milky Way, respectively. However, the timing mass is 
on the high end of the allowed range, with the most fa- 
vored models in the study of K02 yielding a Local Group 
mass of only 2.6 x 10 12 M Q . This indicates that the M31 
and the Milky Way are indeed bound, but that their to- 
tal mass could be lower than suggested by the timing 
argument. Li & White (2008) performed a theoretical 
calibration of the timing argument and found that while 
the timing argument mass is unbiased, it does have a 
theoretical dispersion of 41%. Therefore, it is not statis- 
tically implausible that for the specific case of the Milky 
Way-M31 system the timing argument mass could be an 
overestimate by a factor of ~ 2. The method of estimat- 
ing the Local Group mass based on the size of the turn- 
around (zero-velocity) radius yield systematically lower 
masses than other methods, and is in fact not consistent 
with a bound nature for the M31-Milky Way orbit. This 
and other arguments suggest that this method yields in- 
correct masses that are systematically biased low. 

The M31 transverse velocity implies that M33 is in a 
tightly bound orbit around M31. The calculations of L05 
indicate that for 82% of Monte-Carlo samplings from the 
error ellipse, more tidal deformation of M33 would have 
been expected than is observed. However, this assumes 
that M M 3i = 2.6-3.4 x 10 12 M Q . The percentage is lower, 
and the agreement between observational and theoretical 
constraints on the M31 transverse velocity is better, if the 
M31 mass is as low as 1.6 x 10 12 M , as advocated by 
K02. Nonetheless, our results indicate that some tidal 
deformation of M33 could certainly have occurred. So 
it will be worthwhile to perform more deep searches for 
tidal tails and structures in the outer regions of M33. The 
estimates of the Local Group timing mass do not change 



significantly if the theoretical constraints from the L05 
study of M33 structure are strictly enforced. 

The transverse velocity inferred for M31 (the weighted 
average in Table [3]) corresponds to a proper motion 
= — acosS = —21.5 ± 11.1 /iasyr -1 and hn = 5 = 
— 10.4±9.3/xasyr _1 at a distance D = 770 kpc. It will be 
difficult to obtain an actual proper motion measurement 
in the near future that will rival the accuracy of this. The 
most accurate Hubble Space Telescope proper motion 
measurements for any Local Group galaxy (for the Large 
and Small Magellanic Cloud; Kallivayalil et al. 2006a, b) 
have errors of 50-80 yuasyr^ 1 . VLBI measurements of 
water masers can provide errors of only 5-10 /iasyr" 1 
(Brunthaler et al. 2005, 2007). This is actually not much 
better than what we have already presented here. More- 
over, no water masers are known in M31 to which this 
technique can be applied and, at present detection limits, 
may be none should be expected (Brunthaler et al. 2006). 
Therefore, a high accuracy measurement of the proper 
motion of M31 will have to await future generations of 
astrometric satellites. However, even SIM or GAIA mea- 
surements will at best have uncertainties that are only 
a few times smaller than those reported here (see sec- 
tion 12 of Unwin et al. 2007), and moreover, results are 
not expected from these satellites for at least another 
decade. 

It may be possible to improve the results presented 
here by using a larger number of tracers of the M31 ve- 
locity. If the sample of M31 satellites with water maser 
proper motions can be increased from 2 to ~ 8, then 
this would halve the uncertainty in the average result 
from the method in Section l2~2l However, this may not 
be possible. Water masers are often associated with re- 
gions of star formation, whereas most M31 satellites are 
early type dwarf ellipticals or spheroidals (see Table Q}. 
Even in the gas-rich satellites the prospects for new water 
maser discoveries are not good (Brunthaler et al. 2006). 
Therefore, the number of satellites for which water maser 
proper motions can be obtained is fundamentally limited. 
Alternatively, it may be possible to increase the number 
of tracers at large distances from M31 for which line- 
of-sight velocities are available. A significant increase 
would be needed (from 17 to ~ 68) to merely halve the 
uncertainties in our result from Section [2~T1 While new 
satellites of M31 continue to be discovered on a regular 
basis (e.g., Zucker et al. 2007; Ibata et al. 2007), this is 
unlikely to significantly affect the statistics. Individual 
Red Giant Branch stars in the outer halo of M31 may 
be more promising (e.g., Gilbert et al. 2006), since it 
is possible, at least in principle, to obtain line-of-sight 
velocities for large numbers of such stars. 
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